Cost-based Feature Selection for Network Model Choice

نویسندگان

چکیده

Selecting a small set of informative features from large number possibly noisy candidates is challenging problem with many applications in machine learning and approximate Bayesian computation. In practice, the cost computing also needs to be considered. This particularly important for networks because computational costs individual can span several orders magnitude. We addressed this issue network model selection using two approaches. First, we adapted nine feature methods account features. show classes models that reduced by magnitude without considerably affecting classification accuracy (proportion correctly identified models). Second, selected pilot simulations smaller networks. approach factor 50 accuracy. To demonstrate utility our approach, applied it three different yeast protein interaction best-fitting duplication divergence model. Supplementary materials, including computer code reproduce results, are available online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...

متن کامل

Linear Correlation-Based Feature Selection For Network Intrusion Detection Model

Feature selection is a preprocessing phase to machine learning, which leads to increase the classification accuracy and reduce its complexity. However, the increase of data dimensionality poses a challenge to many existing feature selection methods. This paper formulates and validates a method for selecting optimal feature subset based on the analysis of the Pearson correlation coefficients. We...

متن کامل

Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...

متن کامل

A Genetic Algorithm for Choice-Based Network Revenue Management

In recent years, enriching traditional revenue management models by considering the customer choice behavior has been a main challenge for researchers. The terminology for the airline application is used as representative of the problem. A popular and an efficient model considering these behaviors is choice-based deterministic linear programming (CDLP). This model assumes that each customer bel...

متن کامل

An Energy-based Model for Feature Selection

In this paper we propose an energy-based model (EBM ) for selecting subsets of features that are both causally and predictively relevant for classification tasks. The proposed method is tested in the causality challenge, a competition that promotes research on strengthen feature selection by taking into account causal information of features. Under the proposed approach, an energy value is assi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computational and Graphical Statistics

سال: 2023

ISSN: ['1061-8600', '1537-2715']

DOI: https://doi.org/10.1080/10618600.2022.2151453